Solving Factored MDPs via Non-Homogeneous Partitioning
نویسندگان
چکیده
This paper describes an algorithm for solving large state-space MDPs (represented as factored MDPs) using search by successive refinement in the space of non-homogeneous partitions. Homogeneity is defined in terms of bisimulation and reward equivalence within blocks of a partition. Since homogeneous partitions that define equivalent reduced state-space MDPs can have a large number of blocks, we relax the requirement of homogeneity. The algorithm constructs approximate aggregate MDPs from non-homogeneous partitions, solves the aggregate MDPs exactly, and then uses the resulting value functions as part of a heuristic in refining the current best non-homogeneous partition. We outline the theory motivating the use of this heuristic and present empirical results and comparisons.
منابع مشابه
Solving factored MDPs using non-homogeneous partitions
We present an algorithm for aggregating states in solving large MDPs (represented as factored MDPs) using search by successive re nement in the space of nonhomogeneous partitions. Homogeneity is de ned in terms of stochastic bisimulation and reward equivalence within blocks of a partition. Since homogeneous partitions that de ne equivalent reduced-state-space MDPs can have a large number of blo...
متن کاملModel Reduction Techniques for Computing
We present a method for solving implicit (factored) Markov decision processes (MDPs) with very large state spaces. We introduce a property of state space partitions which we call-homogeneity. Intuitively, an-homogeneous partition groups together states that behave approximately the same under all or some subset of policies. Borrowing from recent work on model minimization in computer-aided soft...
متن کاملModel Reduction Techniques for Computing ApproximatelyOptimal Solutions for Markov Decision
We present a method for solving implicit (factored) Markov decision processes (MDPs) with very large state spaces. We introduce a property of state space partitions which we call-homogeneity. Intuitively, an-homogeneous partition groups together states that behave approximately the same under all or some subset of policies. Borrowing from recent work on model minimization in computer-aided soft...
متن کاملContext-specific Sign-propagation in Qualitative Probabilistic Networks
This paper describes an algorithm for solving large state-space MDPs (represented as factored MDPs) using search by successive refinement in the space of non-homogeneous partitions. Homogeneity is defined in terms of bisimulation and reward equivalence within blocks of a partition. Since homogeneous partitions that define equivalent reduced state-space MDPs can have a large number of blocks, we...
متن کاملSolving Factored MDPs with Large Action Space Using Algebraic Decision Diagrams
We describe an algorithm for solving MDPs with large state and action spaces, represented as factored MDPs with factored action spaces. Classical algorithms for solving MDPs are not effective since they require enumerating all the states and actions. As such, model minimization techniques have been proposed, and specifically, we extend the previous work on model minimization algorithm for MDPs ...
متن کامل